AITopics | noisy data

In label-noise learning, the noise transition matrix reveals how an instance transitions from its clean label to its noisy label. Accurately estimating an instance's noise transition matrix is crucial for estimating its clean label. However, when only a noisy dataset is available, noise transition matrices can be estimated only for some special instances. To leverage these estimated transition matrices to help estimate the transition matrices of other instances, it is essential to explore relations between the matrices of these special instances and those of others. Existing studies typically build the relation by explicitly defining the similarity between the estimated noise transition matrices of special instances and those of other instances.

artificial intelligence, machine learning, transition matrix, (9 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.35)

Add feedback

23451391cd1399019fa0421129066bc6-Paper.pdf

Neural Information Processing SystemsFeb-18-2026, 23:35:29 GMT

dataset, label noise, noisy label, (14 more...)

Neural Information Processing Systems

Country:

North America > United States (0.46)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > Japan (0.04)
Asia > China > Hong Kong (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Learning the Latent Causal Structure for Modeling Label Noise

Neural Information Processing SystemsFeb-18-2026, 08:46:44 GMT

In label-noise learning, the noise transition matrix reveals how an instance transitions from its clean label to its noisy label. Accurately estimating an instance's noise transition matrix is crucial for estimating its clean label.

machine learning, natural language, transition matrix, (18 more...)

Neural Information Processing Systems

Country:

North America > United States > Maryland > Baltimore (0.04)
North America > Canada > British Columbia > Vancouver (0.04)
Asia > Japan > Honshū > Tōhoku > Iwate Prefecture > Morioka (0.04)
(12 more...)

Genre:

Research Report > Experimental Study (0.93)
Research Report > New Finding (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language (0.93)
Information Technology > Data Science (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.68)
(3 more...)

Add feedback

CS-Isolate: Extracting Hard Confident Examples by Content and Style Isolation Y exiong Lin 1 Y u Y ao

Neural Information Processing SystemsFeb-16-2026, 17:35:18 GMT

Label noise widely exists in large-scale image datasets.

artificial intelligence, inductive learning, machine learning, (16 more...)

Neural Information Processing Systems

Country:

Asia > China > Hong Kong (0.04)
North America > United States > Illinois > Cook County > Chicago (0.04)
Europe > Netherlands > North Holland > Amsterdam (0.04)

Genre: Research Report (0.93)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

9308b0d6e5898366a4a986bc33f3d3e7-Paper.pdf

Neural Information Processing SystemsFeb-12-2026, 23:13:06 GMT

anchor point, risk-consistent estimator, transition matrix, (12 more...)

Neural Information Processing Systems

Country:

Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.04)
North America > Canada (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
(2 more...)

Genre: Research Report (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.70)

Add feedback

ca91c5464e73d3066825362c3093a45f-Paper.pdf

Neural Information Processing SystemsFeb-11-2026, 04:30:08 GMT

arxiv preprint arxiv, neural network, noisy label, (14 more...)

Neural Information Processing Systems

Country: Asia > South Korea > Daejeon > Daejeon (0.05)

Genre: Research Report (0.46)

Industry: Information Technology (0.93)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.72)

Add feedback

Cutting Through the Noise: On-the-fly Outlier Detection for Robust Training of Machine Learning Interatomic Potentials

Lam, Terry C. W., O'Neill, Niamh, Schran, Christoph, Schaaf, Lars L.

arXiv.org Machine LearningFeb-10-2026

The accuracy of machine learning interatomic potentials suffers from reference data that contains numerical noise. Often originating from unconverged or inconsistent electronic-structure calculations, this noise is challenging to identify. Existing mitigation strategies such as manual filtering or iterative refinement of outliers, require either substantial expert effort or multiple expensive retraining cycles, making them difficult to scale to large datasets. Here, we introduce an on-the-fly outlier detection scheme that automatically down-weights noisy samples, without requiring additional reference calculations. By tracking the loss distribution via an exponential moving average, this unsupervised method identifies outliers throughout a single training run. We show that this approach prevents overfitting and matches the performance of iterative refinement baselines with significantly reduced overhead. The method's effectiveness is demonstrated by recovering accurate physical observables for liquid water from unconverged reference data, including diffusion coefficients. Furthermore, we validate its scalability by training a foundation model for organic chemistry on the SPICE dataset, where it reduces energy errors by a factor of three. This framework provides a simple, automated solution for training robust models on imperfect datasets across dataset sizes.

artificial intelligence, data mining, machine learning, (19 more...)

arXiv.org Machine Learning

2602.08849

Country:

North America > United States (0.14)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.05)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
Africa > Comoros > Grande Comore > Moroni (0.04)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Data Science > Data Mining > Anomaly Detection (0.61)

Add feedback

Filters

Collaborating Authors

noisy data

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

019fa4fdf1c04cf73ba25aa2223769cd-Paper.pdf

2df278b7fbbea06c3892d2f4388640b6-Paper-Conference.pdf

23451391cd1399019fa0421129066bc6-Paper.pdf

Learning the Latent Causal Structure for Modeling Label Noise

23451391cd1399019fa0421129066bc6-Paper.pdf

Learning the Latent Causal Structure for Modeling Label Noise

CS-Isolate: Extracting Hard Confident Examples by Content and Style Isolation Y exiong Lin 1 Y u Y ao

9308b0d6e5898366a4a986bc33f3d3e7-Paper.pdf

ca91c5464e73d3066825362c3093a45f-Paper.pdf

Cutting Through the Noise: On-the-fly Outlier Detection for Robust Training of Machine Learning Interatomic Potentials